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Small Progress Measures is one of the classical parity game solving algorithms. For games with 
n vertices, m edges and d different priorities, the original algorithm computes the winning regions 
and a winning strategy for one of the players in 0{dm ■{n/\d/2\) ) time. Computing a winning 

strategy for the other player requires a re-run of the algorithm on that player’s winning region, thus 
increasing the runtime complexity to 0{dm- {n/\d/2\)^‘^l'^^) for computing the winning regions 
and winning strategies for both players. We modify the algorithm so that it derives the winning 
strategy for both players in one pass. This reduces the upper bound on strategy derivation for SPM to 
0{dm- {n/[d /2J) ) At the basis of our modification is a novel operational interpretation of the 

least progress measure that we provide. 


1 Introduction 

A parity game IS] [141123 is an infinite duration game played on a directed graph by two players called 
even and odd. Each vertex in the graph is owned by one of the players, and labelled with a natural 
number, called a priority. The game is played by pushing a token along the edges in the graph; the 
choice where to move next is made by the owner of the vertex on which the token currently resides. The 
winner of the thus constructed play is determined by the parity of the minimal (or maximal, depending 
on the convention) priority that occurs infinitely often, and the winner of a vertex is the player who 
has a strategy to force every play originating from that vertex to be winning for her. Parity games are 
positionally determined; that is, each vertex is won by some player |[T4l . and each player has a positional 
winning strategy on her winning region. Solving a game essentially means deciding which player wins 
which vertices in the game. 

Parity games play an important role in several foundational results; for instance, they allow for an 
elegant simplification of the hard part of Rabin’s proof of the decidability of a monadic second-order 
theory, and a number of decision problems of importance can be reduced to deciding the winner in 
parity games. For instance, the model checking problem for the modal /x-calculus is equivalent, via 
a polynomial-time reduction, to the problem of solving parity games ID [TH ; this is of importance in 
computer-aided verification. Winning strategies for the players play a crucial role in supervisory control 
of discrete event systems, in which such strategies are instrumental in constructing a supervisor that 
controls a plant such that it reaches its control objectives and avoids bad situations; see e.g. |T| and 
the references therein. In model checking, winning strategies are essential in reporting witnesses and 
counterexamples, see ifT^ . 

A major impetus driving research in parity games is their computational status: the solution problem 
lies in UP and coUP and is, despite the continued research effort, not known to be in PTIME. Deter¬ 
ministic algorithms for solving parity games include the classical recursive algorithm ll20l and small 
progress measures (SPM) algorithm ifTTl . the bigstep algorithm ifTSl . the deterministic subexponential 
algorithm ifT^ and a class of strategy improvement algorithms ifT^fT^ Bl. 

Strategy improvement algorithms were long perceived as likely candidates for solving parity games 
in polynomial time, but, save a recently introduced symmetric variant liTTl . they were ultimately proven 
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to be exponential in the worst-case p6|. The deterministic subexponential algorithm, running in 
where n is the number of vertices in the game, is a modification of the recursive algorithm which itself 
runs in 0{m ■ n‘^), where m is the number of edges and d is the number of different priorities in the 
game. Bigstep, which runs in 0{m- {Kn/dy^‘^'1), where fc is a small constant and y{d) k, d/3, is a 
combination of the recursive algorithm and the SPM algorithm. This latter algorithm solves a game in 
0{dm- (n/[d/2j)L‘^/^J). 

Somewhat surprisingly, our knowledge of classical algorithms such as SPM and the recursive algo¬ 
rithm is still far from complete. For instance, the recursive algorithm is regarded as one of the best 
algorithms in practice, which is corroborated by experiments Q. However, until our recent work [8| 
where we showed the algorithm is well-behaved on several important classes of parity games, there was 
no satisfactory explanation why this would be the case. In a similar vein, in ibid, we provided tighter 
bounds on the worst-case running time, but so far, no tight bounds for this seemingly simple algorithm 
have been established. We expect that if improvements on the upper bound on the parity game solving 
problem can be made, such improvements will come from improvements in, or through a better under¬ 
standing of the classical algorithms; this expectation is fuelled by the fact that these classical algorithms 
and the ideas behind them have been at the basis of the currently optimal algorithms. 

In this paper, we focus on Jurdzihski’s small progress measures algorithm. Using a fixpoint compu¬ 
tation, it computes a progress measure, a labelling of vertices, that witnesses the existence of winning 
strategies. In general, no clear, intuitive interpretation of the information contained in the progress mea¬ 
sures has been given, and the mechanics of the algorithm remain rather technical. This is in contrast 
to the self-explanatory recursive algorithm, and the strategy improvement algorithm, where, thanks to 
the ordering of plays according to profiles, at every step, one has a clear picture of the currently known 
best strategy. Apart from Jurdzihski’s original article, some additional insight was offered by Klauck 
and Schewe. In ifTOl . Klauck defines a specific parify progress measure for a solifaire game wifh only 
even simple cycles and explains fhaf if has a particular inferprefafion, buf his parify progress measure is 
nof generally related fo fhe measure computed by fhe SPM algorifhm (nol even in fhe setting of solifaire 
games). Schewe, in his paper on bigstep ifTSll . analyses progress measures wifh resfricfed codomains and 
shows how fhey can be ufilised fo efficienfly defecf small dominions. Our first confribufion is fo provide a 
better undersfanding of fhese progress measures and fhe intermediate values in fhe fixpoinl compufafion, 
see SecfionlH By doing so, a better undersfanding of fhe algorifhm if self is obfained. 

Progress measures come in fwo flavours, viz. even-and odd-biased, and fheir compufafion time is 
bounded by eifher 0{dm- (n/[d/2j ) or 0{dm- (n/[<i/2] ), depending on fhe parify of fhe 

exfreme priorities fhaf occur in fhe game. From an even-biased progress measure, one immediately 
obfains winning regions for both players, buf only a winning sfrafegy for player even on ifs winning 
region and nol for her opponenl. Likewise for odd-biased progress measures. Oblaining fhe winning 
sfrafegy for an opponenf fhus requires re-running fhe algorifhm on fhe opponenl’s winning region. Note 
fhaf fhe efforf fhaf needs fo be laken fo obfain a sfrafegy in fhe same lime bound as fhe winning region 
stems from a more general fealure of parity games: a winning parlilion in ilself does nol allow one fo 
efficienfly compufe a winning sfrafegy (unless Ihere is an efficienf algorifhm for solving parify games). 
This basic resull, which we neverlheless were unable fo find in fhe lileralure, is formalised in Secfion[5] 

An essential consequence of Ihis is fhaf fhe algorifhm solves a parity game in 0{dm • (n/ [d/2j) ), 

as one can always compufe one of fhe fwo types of progress measures in fhe shorler lime bound. Confrary 
fo whaf is slated in ifTTI . fhe same reasoning does nol apply fo computing fhe winning sfrafegy for a fixed 
player; Ihis still requires 0{dm- ) in fhe worsl case, as also observed by Schewe in ITSl . 

An inlriguing open problem is whelher if is al all possible fo derive fhe winning slralegies for bolh players 
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while computing one type of measure only, as this would lower the exponent in the time bound to [d /2J. 
Our second contribution is to give an affirmative answer to the above problem. We modify the generic 
SPM by picking a partial strategy when a vertex won by player □ is discovered, and subsequently adjust 
the lifting policy so that it prioritises the area which contains an n-dominion. Both steps are inspired by 
the interpretation of the progress measures that we discuss in Section |4l Like the original algorithm, our 
solution, which we present in SectionUl still works in polynomial space. Formal proofs of all results can 
be found in our technical report fQj. 


2 Parity Games 

A parity game is an infinite duration game, played by players odd, denoted by □ and even, denoted by 
O, on a directed, finite graph. The game is formally defined as follows. 

Definition 1 A parity game is a tuple {y,E,lP, (Vo, V□))^ where 

• V is a finite set of vertices, partitioned in a set Vo of vertices owned by player O, and a set of 
vertices Vb owned by player □, 

• F'CFxVisa total edge relation, i.e. all vertices have at least one outgoing edge, 

• l^'.V —)• N is a priority function that assigns priorities to vertices. 

Henceforth, we assume that O denotes an arbitrary player; that is O G {DjO}. We write O for O’s 
opponent: C> = □ and □ = O. Parity games are depicted as graphs; diamond-shaped nodes represent 
vertices owned by player O, box-shaped nodes represent vertices owned by player □ and the priority 
assigned to a vertex is written inside the node, see the game depicted in Figure [2 



Figure 1: A simple parity game with 4 different priorities, in which 4 vertices are owned by player odd 
and 2 vertices are owned by player even. 

Throughout this section, assume that G = {V,E, , (Vo, Vb)) is an arbitrary parity game. We write 

V —)■ w iff (v,w) G E, and we write v* to denote the set of successors of v, f.e {w G V | v —>• w}. For a set 
of vertices W C F, we will denote the minimal priority occurring in W with min^(lV); by V we denote 
the set of vertices with priority /; likewise for V<^. For a set A C V, we define GHA as the structure 
{A,Eri{AxA),IiP\A, (Vo nA, VbnA)); the structure G\A is defined as GH (V \A). The structures GHA 
and G \ A are again parity games if their edge relations are total. 

Plays and Strategies A sequence of vertices vi,..., v„ is a path if —>• v^+i for all 1 < m < n. Infinite 
paths are defined in a similar manner. We write tt, to denote the vertex in a path n. 

A game starts by placing a token on some vertex v G V. Players O and □ move the token indefinitely 
according to a single simple rule: if the token is on some vertex that belongs to player O, that player 
moves the token to an adjacent vertex. An infinite sequence of vertices created this way is called a play. 
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The parity of the least priority that occurs infinitely often on a play defines fhe winner of fhe play: player 
O wins if, and only if fhis priorify is even. 

A strategy for player O is a parfial function o.V^ —>• V salisfying fhaf whenever if is defined for a 
finife pafh u\---Un £ T+, bofh u„ G Vq and a{ui ■ ■ ■ u„) G «*. We denofe fhe sef of sfrafegies of player O 
by §Q. An infinite play u\ U 2 u^--- is consistent wifh a sfrafegy a if for all prefixes of fhe play 

for which a{ui •••«„) is defined, = o{ui - The sef of all plays, consisfenf wifh some sfrafegy 
a, and sfarfing in v is denofed n(a,v). Some sfrafegy a is winning for player O from verfex v iff all 
plays consisfenf wifh a are won by player O. Verfex v is won by player O whenever she has a winning 
sfrafegy for all plays sfarfing in verfex v. Parify games are determined |31, meaning fhaf every verfex 
is won by one of fhe players; fhey are even positionally determined, meaning fhaf if O wins a vertex 
fhen she has a winning positional strategy, a sfrafegy fhaf defermines where fo move fhe foken nexf 
based solely on fhe verfex on which fhe foken currenfly resides. Such sfrafegies can be represented by a 
funcfion a:Vo V, and fhe sef of all such sfrafegies for player O is denofed Sq- Solving a parify game 
G essentially means computing fhe partition (Wino(G), Winn(G)) of V info vertices won by player O 
and player □, respectively. 

Example 1 In fhe parify game depicted in Figure [H verfices vi,V 2 and V 3 are won by player O while 
vertices V 4 , V 5 and vg are won by player □. A winning positional sfrafegy for player O is fo play from V 2 
fo v\ and from V 3 fo V 2 . A winning sfrafegy for □ is fo move from V 4 fo vg and from V 5 fo vg. 

Attractors and Dominions An O-attractor info a sef 17 C V confains all fhose verfices from which 
player O can force any play fo 17; if is formally defined as follows. 

Definition 2 The O-attractor into a set 17 C V, denoted 0-Attr{U), is the least set A C V satisfying: 

1. l/CA 

2. (a) if w G Vq w* n A / 0, then w G A 
(b) if w G Vq and w* C A, then w G A 

Observe that the complement of an attractor set of any subset of a parity game induces a parity game, i.e. 
G \ 0-Attr{U ) for any U and O is a well-defined parity game. Whenever we refer to an attractor strategy 
associated with 0-Attr{U), we mean the positional strategy that player O can employ to force play to U ; 
such a strategy can be computed in ^(|V| + lEl) using a straightforward fixpoint iteration. 

A set of vertices U is an O-dominion whenever there is a strategy a for player O such that every play 
starting in U and conforming to a is winning for O and stays within U. A game is a paradise for player 
O if the entire game is an O-dominion. 

Example 2 Reconsider the parity game of Figure [T] We have 0-Attr{v2) = {v 2 ,V 3 } and □-Artr(v 4 ) = 
{v 4 ,V 5 ,vg}. The winning region {vi,V 2 ,V 3 } is an O-dominion, but the subset {v 2 ,V 3 } is not; the set 
{v 4 ,vg} is an D-dominion. 

3 Jurdzinski’s Small Progress Measures Algorithm 

The SPM algorithm works by computing a measure associated with each vertex. This measure is such 
that it decreases along the play with each “bad” odd priority encountered, and only increases upon reach¬ 
ing a beneficial even priority. In what follows, we recapitulate the essentials for defining and studying 
the SPM algorithm; our presentation is—as in the original paper by Jurdzifiski—from the perspective of 
player O. 
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Let G= {V,E, (Vo, Vn)) be a parity game. Let J be a positive number and let a € be a d-tuple 
of natural numbers. We number its components from 0 to d—\,i.e. a = (Oq, oti,..., ), and let < on 

such r/-tuples be given by the lexicographic ordering. We define a derived ordering <, on A:-tuples and 
/-tuples of natural numbers as follows: 

(ao,ai,...,aO <,■ (j8o,j8i,...,j8/) iff (ao,ai,.< (j3o,j8i,...,j8i) 

where, ili> k or i > I, the tuples are suffixed wifh Os. Analogously, we wrife a =, jS fo denofe fhaf a 
and j3 are idenfical up-fo and including position i. The derived ordering provides enough information fo 
reason abouf fhe inferesfing bifs of plays: when encountering a priorify i in a play, we are only inferesfed 
in how oflen we can or musf visif vertices of a more significanl {i.e. smaller) priorify fhan i, whereas we 

no longer care abouf fhe less significanl priorilies fhaf we shall encounfer; a more precise inferprelafion 

of fhe informafion lhat will be encoded will be given in Section |4l 

Now, assume from hereon fhaf <i — 1 is fhe largesl priorify occurring in G; i.e., d is one larger fhan 
fhe largesl priorify in G. For i E N, lei n, = |V|. Define C U {T }, as fhe largesl sel conlaining T 
(T ^ N^) and only Ihose d-luples wifh 0 on even positions and nalural numbers < n,- on odd posifions i. 

The lexicographical ordering < and the family of orderings <, on d-tuples is extended to an ordering 
on by setting a < T and T = T. Let p:V —)■ and suppose w E v*. Then Prog(p, v, w) is the least 

m E M^, such that 

• m >^(v) p(w) if ^(v) is even, 

• m p(w), or m = p{w) = T if is odd. 

Definition 3 Function p is a game parity progress measure if for all v E V: 

• if V E Vo, then for some w E v*, p(v) >^(v) Prog(p,v,w) 

• if V E Vb, then for all w E v*, p(v) >^(v) Prog(p,v,w) 

The following proposition is due to Jurdzinski ifTTI ; it shows that the least game parity progress measure 
characterises the winning regions of a parity game. 

Proposition 1 If p is the least game parity progress measure for G, then for all v E V: p (v) / T iff 
V E Wino(G). 

The least game parity progress measure can be described as the least fixpoint of a monotone transformer 
on the complete lattice we define nexl. Lei p,p':V —and define p IT p' as p(v) < p'{v) for all v E V. 
We wrife p C p' if p T p' and p / p'. Then fhe sel of all functions V —)• wifh T is a complete lallice. 

The monofone Iransformer defined on Ihis sel is given as follows: 

IT./ ^ [p[v^max{p(v),min{Prog(p,v,w) I v-^ w}}] if v E Vo 
Lift(p,v) = < 

I p[v i-> max{p(v),max{Prog(p,v,w) | v ^ w}}] if v E Vn 

As a consequence of Tarski’s fixpoinf Iheorem, we know fhe leasl fixpoinf of fhe above monofone Irans¬ 
former exisls and can be compuled using a slandard fixpoinf ileralion scheme. This leads fo fhe original 
SPM algorilhm, see Algorifhmdl Upon lerminalion of fhe ileralion wilhin fhe SPM algorifhm, fhe com¬ 
puted game parity progress measure p is used fo compute fhe sefs Wino(G) and Winn(G) of vertices 
won by player O and □, respectively. 

Theorem 1 Algorithm [T] solves a parity game in 0{dm • (n/ [r//2j) ), see ifTTll . 
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Algorithm 1 The original Small Progress Measures Algorithm 
1 : function SPM(G) 

2 : Input G = {V,E,^, {Vo, Vn)) 

3: Output Winning partition {Wino{G), Wino{G)) 

4: p ^ AvEP. (0,...,0) 

5: while p C Lift(p,v) for some v E P do 

6: p Lift(p,v) for some v E P such that p C Lift(p,v) 

7: end while 

8: return ({v E P | p(v) / T},{v E P | p(v) = T}) 

9: end function 


The runtime complexity of SPM is obtained by considering the more optimal runtime of solving a 
game G, or G’s ‘dual’; the latter is obtained by shifting all priorities by one and swapping ownership of 
all vertices (alternatively, a ‘dual’ algorithm can be used, computing with a domain M^). The runtime 
complexity for computing winning strategies for both players using the SPM algorithm is worse than the 
runtime complexity of solving the game. A winning strategy Oo'-Vo —)• P for player O can be extracted 
from p by setting Oo{v) = w for v E PonWino(G) and w E v* such that p(>v) < p(w') for all w' E v*. A 
winning strategy for player □ cannot be extracted from p a posteriori, so, as also observed in ifTSl . the 
runtime complexity of computing a winning strategy cannot be improved by considering the dual of a 
game (contrary to what is stated in ifTTI l. 

Theorem 2 [See also lfl5l l Algorithm [U can compute winning strategies for both players in 0{dm ■ 

As an illustration of the above observations, consider the family of games depicted in FigureH] The more 
optimal runtime of 0{dm • (n/ [<i/2j) ) will be achieved by solving the games themselves (using M^) 

and not their dual. As all games in the family are D-paradises, we cannot extract a winning strategy for 
player □ from the computed progress measure and the only option we have is to solve the dual games 
with the less favourable runtime of 0{dm- {n/\d/2\)^^/'^^ ). In fact, all instances of the family of games 
depicted in Figure |2] are solved exponentially faster than their dual, underlining the potential practical 
implications of re-running the algorithm. 


2N-2 


■ 2N- 1 


2N 


Figure 2: A parity game won by player □. Solving the game using M^, the first T value is reached after 
the first full pass of the cycle containing priority 1 {0{N^) using a fair lifting strategy), and it will then 
propagate through the game. Solving the dual game, or using takes exponential time to lift the node 
with priority 2N. 


To facilitate the analysis of SPM, we will use the following terms and notions. The term measure 
refers to the intermediate values of p in the lifting process as well. Given a tuple m E M^, we say that 
the position i in m is saturated, if (m),- = |V; |. 
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4 An operational interpretation of progress measures 

While, from a technical perspective, SPM is a relatively simple algorithm in the sense that it is concise 
and its individual steps are elementary operations, it lacks a clear and appealing explanation of the devices 
used. It is therefore difficult to understand, and possibly enhance. Apart from the formal definition of 
progress measures, little explanation of what is hidden behind the values in tuples is offered. Notable 
exceptions are ||T3]1 . which explains that for D-solitaire games with only even simple cycles, one can use 
the maximal degrees of ‘odd stretches’ (a concept we make precise below) in order to define a parity 
progress measure, and Schewe’s bigstep paper lITSl . where it is shown that dominions of a bounded size 
can be detected using measures with a restricted codomain. Klauck’s construction for a specific parity 
progress measure breaks down for arbitrary parity games and the constructed parity progress measure 
is not related to the measure that is computed by the SPM algorithm, nor to any of the intermediate 
measures. In general, the high-level intuition is that the larger progress measure values indicate more 
capabilities of player □, and a value at a given position in the tuple is somehow related to the number of 
priorities that □ can enforce to visit. 

In what follows, we present a precise and operational interpretation of measures. Our interpretation 
is similar in spirit to the one used in lfT3l . but applicable to all parity games, and uses a concept known 
from the realm of strategy improvement algorithms - a value (or profile) of a play. Here, values are 
defined in terms of maximal odd-dominated stretches occurring in a prefix of a play. With this notion at 
hand, we can consider an optimal valuation of vertices, being the best lower bound on play values that 
player O can enforce, or the worst upper bound that □ can achieve, i.e. it is an equilibrium. The optimal 
valuations range over the same codomain as progress measures, and the main result of this section states 
that the least game parity progress measure is equal to the optimal valuation. 

Let denote all tuples in U {T} such that for all m E and even positions i < d, {m)i = 0; i.e. 
compared to M^, we drop the requirement that values on odd positions i are bounded by |V) |. Elements 
in are ordered in the same fashion as those in M^. Given a play K, a stretch is a subsequence 
71^71^+1... 71^+/ of 71. For a priority k, a k-dominated stretch is a stretch in which the minimal priority 
among all vertices in the stretch is k. The degree of a k-dominated stretch is the number of vertices with 
priority k occurring in the stretch. 

Definition 4 Let us denote all plays in the parity game by IT. An O—value (or simply value) of a play is 
a function Go '■ H defined as follows: 

• if 71 is winning for □, then do{tt) = T 

• if 71 is winning for O, then do{tt) = m, where m ^T, and for every odd position i, {m)i is the 
degree of the maximal /-dominated stretch that is a prefix of 7t 

Observe that a play value is well-defined, as an infinite /-dominated stretch for an odd / implies that a 
game is won by □, and its value is T in such case. 

Example 3 Suppose that the sequence of priorities corresponding to a certain play Tt is 453453213(47)*. 
Then Qo{ti) = (0,1,0,2,0,0,0,0). 

The theorem below links the progress measure values to players’ capabilities to enforce beneficial 
plays or avoid harmful ones, where the benefit from a play is measured by a specially devised play 
value, as it is done in strategy improvement algorithms. This offers a more operational view on progress 
measure values, which, combined with a more fine-grained analysis of the mechanics of SPM allows us 
to extract winning strategies for both players in the next section. 
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Theorem 3 If p is the least progress measure of a parity game G, then, for all v: 

1 . there is a strategy an € §□ such that for every tt e n(an, v), 6o(^) > P(v) 

2. there is a strategy a<y G §o such that for every n G n(ao,v), 6o(?t) < p(v) 

A notable difference between strategy improvement algorithms and SPM is that SPM does not work 
with explicit strategies, and the intermediate measure values do not represent any proper valuation in¬ 
duced by strategies - only the final least progress measure does. Still, these intermediate values give 
an underapproximation of the capabilities of player □ in terms of odd-dominated stretches that she can 
enforce. 

Note that a consequence of Theorem [3] is that the least (resp. greatest) play values that player □ (resp. 
O) can enforce are equal, and coincide with the least game parity progress measure p computed by SPM. 
Observe also that player O can always achieve the strategy guaranteeing the optimal even-biased play 
value with a memoryless strategy, whereas player □ may require to that end a strategy that depends on a 
play’s history. 

5 Strategy construction for player □ 

Computing winning strategies is typically part of a practical solution to a complex verification or a 
controller synthesis problem. In such use cases, these strategies are employed to construct witnesses and 
counterexamples for the verification problems, or for constructing control strategies for the controller |T]. 
As we explained in Section [3l the SPM algorithm can easily be extended to construct a winning strategy 
for player O. The problem of deriving a winning strategy for player □ in SPM (other than by running 
the algorithm on the ‘dual’ game, or by using a ‘dual’ domain M'^) has, however, never been addressed. 
Note that the problem of computing strategies is at least as hard as solving a game. Indeed, even if we are 
equipped with a valid winning partition (Wino(G), Winn(G)) for a game G, then deriving the strategies 
witnessing these partitions involves the same computational overhead as the one required to compute 
(Wino(G), Winn(G)) in the first place. 

Proposition 2 The problem of finding the winning partition (Wino(G), Winn(G)) of a given game G 
can be reduced in polynomial time to the problem of computing a winning strategy for player O in a 
given O-dominion. 

Deriving a strategy for both players by using the SPM to compute measures and measures 
consecutively, or even simultaneously, affects, as we already discussed in Section [3l SPM’s runtime. 
Being able to compute □ strategies without resorting to the aforementioned methods would also allow 
us to potentially significantly improve efficiency on such instances as given by Figure |2] It may be clear, 
though, that extracting a winning strategy from the small progress measures algorithm for vertices with 
measure T will require modifying the algorithm (storing additional data, augmenting the lifting process). 
For instance, simply following the vertex that caused the update to top, fails, as the example below shows. 

Example 4 Reconsider the game depicted in Figure [T] Recall that vertices V 4 ,V 5 and vg are won by 
player □, and in all possible lifting schemes, the first vertex whose measure becomes T is vg. After that, 
a possible sequence of liftings is that first p(v 5 ) is set to T (due to vg), followed by p(v 4 ) = T (due to 
vj). If we set the strategy based on the vertex that caused the given vertex to be lifted to top, we obtain 
a(v 4 ) = V 5 , which is not winning for □. 

The key problem is that for vertices won by player □, from some point onward, the lifting process 
discards significant information. This is best seen in case of lifting to T - a partial characterisation of 
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reachable odd priorities contained in a tuple (see also our previous section) is ultimately replaced with a 
mere indication that player □ can win. 

5.1 A Bounded □-Dominion 

Consider a game G on which a standard SPM algorithm with an arbitrary lifting policy has been applied. 
Suppose that at some point a vertex v is the first one to be lifted to T, and after lifting of v the algorithm 
is suspended, resulting in some temporary measure p. Let k be the priority of v. 

We will start with two straightforward observations. The first one is that k must be an odd number; 
this is because a vertex with an even priority obtains, after lifting, a p-value equal to the p-value of one 
of its successors, and therefore it cannot be the first vertex lifted to T. Another immediate conclusion is 
that at least one (or all, if v E Vo) successor(s) of v has (have) a p-value saturated up to the ^-th position, 

i.e. it is of the form m = (0, |Li |,0, IV 3 1,... ,0, |Vi|, * * *); were it not the case, then a non-top value m' 
such that m' >km would exist, which would be inconsistent with the definition of Prog. 




Figure 3: Snapshot of the SPM algorithm after the first vertex v is lifted to top. 

There are two more complex properties, which we can utilise to modify the SPM algorithm and 
compute the winning strategy for player □ (see Figure O. 

1. Vertex v belongs to an n-dominion D within G such that the minimal priority in D is k. 

2. If V E Vb, then picking the successor u,nax of v with the maximal current p-value among v* is a part 
of a (positional) winning strategy for □ that stays within such a dominion D as described above. 

The intuition concerning the above facts is as follows. Vertices with a measure value m saturated up to 
but possibly excluding a certain position i have a neat interpretation of the measure value at position i: 

Player □ can force the following outcome of a play: 

1. priority i appears mi times without any lower priority in between 

2. the play will reach a T -labelled vertex via priorities not more significant than i 

3. the play enters a cycle with an odd dominating priority larger (less significant) than i. 

Therefore, in the situation as described above, □ can force a play starting at v to first go in one step to the 
successor Umax of V with a measure of the form (0, | Vi |, 0 , | V31, • • •, 0 , |\4|, * * *), and then to play further 
and either force a less significant odd-dominated cycle (cases 2 and 3, since v is the only T-labelled 
vertex), or to visit vertices with priority k\Vk\ times without any lower priority in between. But in the 
latter case, since v has priority k, we have in fact | 14 | -|- 1 vertices with priority k not “cancelled” by a 
lower priority. Hence player □ has forced an odd-dominated cycle with the lowest (most significant) 
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priority k. Note that this does not imply we can simply construct a winning strategy for □ by always 
picking a successor with the maximal measure to further vertices that can be visited along the play; such 
a method may lead to an erroneous strategy, as illustrated by Figure IH 


( 0 , 2 , 0 , 0 ) 


( 0 , 2 , 0 , 1 ) 

Figure 4: A game, won entirely by player □, and demonstrating that a strategy defined by a greedy 
choice of vertex with the maximal tuple does not work. After lifting the vertices in order vi,V 3 ,V 2 ,V 4 ,vi, 
we obtain the measure values as above. Player □ would then choose V 3 , which leads to a losing play, 
whereas the choice of the other successor (V 4 ) yields a winning play for □. 

Propagating a top value only to vertices with less significant priorities is, however, safe. This can 
be achieved efficiently by a slightly modified affractor thaf works within a given context of vertices W, 
which we call a guarded attractor. 

Definition 5 Let k be some priority and U ,W some sets for which U C IT n V>k. Then U-Attr^^{U) is 
the least set A satisfying: 

1 . U <ZAQWr\V>k 

2. (a) if M G Vn and u* HA / 0, then n G A 
(b) if M G Vo and u* GW C A, then n G A 

If IV = V, we drop the subscript IV from the guarded attractor. 

The theorem below forms the basis of our algorithm; it describes the relevant information about an □- 
dominion that can be extracted once the first vertex in the game is lifted to top. 

Theorem 4 Let G be a parity game on which an arbitrary lifting sequence is applied, such that at some 
point a vertex v with ^^{v) = k is the first vertex whose measure value becomes top. Let p be the 
temporary measure at that point. The following holds: 

• if V G Vb, then for every successor n of v with a maximal measure among v* there is an D-dominion 
Du containing □-Affr-^({v}) such that for all w G Du, 3^{w) > k. Moreover, □ has a winning 
strategy that is closed on D„, and which is defined on v as a(v) = u, and on U-Attr-^{{v}) \ {v} 
as the strategy attracting towards v, 

• if V G Vo, then there is an □-dominion D containing U-Attr-^{{v}) such that for all w G D, £P{w) > 
k. Moreover, □ has a winning strategy a that is closed on D, and defined on D-Attr-^dv}) \ {v} 
as the strategy attracting towards v. Note that in this case v* C D. 

5.2 The Extended SPM Algorithm 

Theorem IHcaptures the core idea behind our algorithm. It provides us with the means to locally resolve 
{i.e. define a local strategy for) at least one vertex in Winn(G), once a top value is found while lifting. 
Moreover, it indicates in which part of the game the remainder of the D-dominion resides, implying that 
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Algorithm 2 Modified SPM with strategy derivation for player Odd 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 


function Solve(G) 

Input G = (V,E,^, (Vo,Va)) 

Output Winning partition and strategies ({Wino(G),o'), (Win\j(G),(j)) 
initialise a to an empty assignment 
p ^ AwEP. (0 ,...,0) 

SPM-Within(1/) 

compute strategy a' for player Even by picking min. successor w.r.t. p 
return (({vEE | p(v) / T}, a'), ({v E E | p(v) = T,a)) 

procedure SPM-Within(IE) 
while iyv / 0) do 

while p C Lift(p,w) for some w €W and for all w E W:p{w) / T do 
p ^ Lift(p,w) for w E VE such that p C Lift(p,>v) 

end while 

if for all w E kE: p(w) / T break end if 
V •(— the (unique) vertex in W such that p (v) = T 
^(v) 

a(v) •(— u where u E v* n VE for which p(u') <k p(u) for all u' E v* n VE 
RES ^ U-Attrl\{v]) 
for all w E RES \ {v} do 

p(w) ^ T 

if w E Vb then assign a(>v) the strategy attracting to v end if 

end for 

DOM ^ RES 

IRR 0-Attr]v({w E kE | liP{w) < /:}) 

REM ^kE\(RESUIRR) 

SPM-Within(REM) 

DOM ^ DOM U {w E REM | p(w) = T} 

A ■<— □-Affrvy(DOM) 

for all w E A \ DOM do 

p(w) T 

if w E kn then assign a{w) to be the strategy attracting to DOM 

end if 
end for 
W ^IE\A 
end while 
end procedure 
end function 


one can temporarily restrict the lifting to that area until the dominion is fully resolved. We will give a 
description of our solution (Algorithm and informally argue the correctness of our approach. Eor a 
(intricate and rather involved) formal proof, we refer to f9|. 

The algorithm proceeds as follows. Eirst, a standard SPM is run until the first vertex reaches top [1. 
[T2mA] in Alg. |2]]. Whenever v is the first vertex lifted to top, then the issue of the winning strategy for 
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V can be resolved immediately [1. [18]], as well as for vertices in the guarded attractor of v (if there are 
any). We will denote this set of ‘resolved’ vertices with RES. Moreover, we can restrict our search for 
the remainder of the D-dominion D only to vertices with priorities not more significant than k, in fact 
only those from which player O cannot attract a play to visit a priority more significant than k. Hence we 
can remove from the current computation context the set IRR = 0-Attr{{w E W \ ^(w) < k}), vertices 
that may be considered at the moment irrelevant [1. 125i - l26]l . 

After discarding the resolved and currently irrelevant vertices, the algorithm proceeds in the remain¬ 
ing set of vertices that constitutes a proper subgame (i.e. without dead ends) induced by the set REM. 
After the subroutine returns [1. [27]], all vertices labelled with top are won by player □ in the subgame 
Gn REM. In other words, those vertices are won by □ provided that the play does not leave REM. Since 
the only way for player O to escape from REM is to visit RES, where every vertex is won by player 
□, the top-labelled vertices from REM are in fact won by □ in the context of the larger game GDW. 
Therefore the set DOM computed in line [28] constitutes an D-dominion within the game GCiW, in which 
we have moreover fully defined a winning sfrafegy a for player □. Finally, every verfex from F \ DOM 
fhaf can be affracfed by player □ fo fhe dominion DOM is cerfainly won by □, and for fhose vertices we 
assign fhe sfandard sfrafegy attracting fo DOM. The D-dominion A is fhen removed, and fhe compufafion 
confinues in fhe remaining subgame. 

The algorifhm may af firsf sighf appear fo deviafe much from fhe sfandard SPM algorifhm. However, 
fhe addifional overlay, apart from defining fhe sfrafegy, is no more fhan a special tiffing policy fhaf 
femporarily resfricfs fhe lifting fo parfs where an D dominion resides. 

Every affracfor compufafion fakes 0{n-\-m) fime, and whenever if occurs, af leasf one new verfex is 
‘resolved’. Hence fhe fofal exfralime infroduced by fhe affracfor compufafions is bounded by 0{n{n+m)). 
As wifh fhe sfandard SPM, fhe tiffing operations dominafe fhe running fime, and fheir fofal number for 
every vertex is bounded by fhe size of M^. 

Theorem 5 For a game G wifh n vertices, m edges, and d priorifies, SOLVE solves G and computes 
winning sfrafegies for player ^ and D in worsf-case 0{dm ■ (n/[if/2j ). 


6 Illustrating the new algorithm 

We illusfrafe fhe various aspecfs of Algorifhm |2] on fhe game G depicted in Figure [5l wifh fwo (over¬ 
lapping) subgames G\ and G 2 . Nofe fhaf fhe enfire game is an D-paradise: every verfex evenfually is 
assigned measure T by Algorifhm |2] (and Algorifhmd] for fhaf matter). Suppose we use a tiffing sfrafegy 
prioritising V 2 ,V 3 ,V 7 and vg; fhen vertex V 3 ’s measure is fhe firsf fo reach T, and fhe successor wifh maxi¬ 
mal measure is vy. Therefore, D’s sfrafegy is fo move from V 3 fo vy. The sef RES, compufed nexf consisfs 
of vertices V 3 and vy; fhe sfrafegy for vy is sef fo V 3 and ifs measure is sef fo T. The O-affracfor info fhose 
vertices wifh priorifies > 3, i.e., vertices vi and V 4 , is exacfly fhose vertices, so, nexf, fhe algorifhm zooms 
in on solving fhe subgame Gi. 

Suppose fhaf wifhin fhe latter subgame, we priorifise fhe tiffing of verfex vy and vs; fhen verfex vy’s 
measure is sef fo T firsf, and vy’s successor wifh fhe largesf measure is vg; fherefore D’s sfrafegy is fo 
move from vy fo vg. Af fhis poinf in fhe algorifhm, RES is assigned fhe sef of vertices vy and vg, and 
fhe measure of vg is sef fo T. Nofe fhaf in fhis case, in fhis subgame, fhe winning sfrafegy for D on 
Vy is fo remain wifhin fhe sef RES. Since all remaining vertices have more signficanf priorifies fhan 
Vy, or are forced by O fo move fhere, fhe nexf recursion is run on an empfy subgame and immediafely 
refurns wifhouf changing fhe measures. Upon refurn, fhe D-affracfor fo all D-won verfices (wifhin fhe 
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Figure 5: An example game G with two (overlapping) subgames Gi and G 2 . 


subgame Gi, so these are only the vertices v? and vg) is computed, and the algorithm continues solving 
the remaining subgame (i.e. the game restricted to vertices V 5 ,V 6 and vg), concluding that no vertex in 
this entire game will be assigned measure T. 

At this point, the algorithm returns to the global game again and computes the D-attractor to the 
vertices won by player □ at that stage {i.e. vertices V 2 ,vj,,v-j and vg), adding vertices v\ and vg, setting 
their measure to T and setting D’s strategy for vg to move to vi. 

As a final step, the algorithm next homes in on the subgame G 2 , again within the larger game. The 
only vertex assigned measure T in the above subgame is vertex V 4 ; at this point RES is assigned all 
vertices in G 2 , the measure of V 5 and vg is set to T and the □ strategy for vertex V 5 is set to V 4 . This 
effectively solves the entire game. 


7 Conclusions and Future Work 

In this paper, we studied the classical Small Progress Measures algorithm for solving parity games. The 
two key contributions of our work are as follows: 

1. We have proposed a more operational interpretation of progress measures by characterising the 
types of plays that players can enforce. 

2. We have provided a modification of the SPM algorithm that allows to compute the winning strate¬ 
gies for both players in one pass, thus improving the worst-case running time of strategy derivation. 

The second enhancement has been made possible due to a thorough study of the contents of progress 
measures, and their underapproximations in the intermediate stages of the algorithm (building on the 
proposed operational interpretation). 

As for the future work, we would like to perform an analysis of SPM’s behaviour on special classes 
of games, along the same lines as we have done in case of the recursive algorithm fS]. More specifically, 
we would like fo idenfify fhe games for which SPM runs in polynomial lime, and sludy enhancemenfs 
lhal allow lo solve more fypes of games efficienfly. If would also be interesting fo see whefher fhe ideas 
behind our modification of fhe SPM algorilhm carry over fo fhe algorilhm for small energy progress 
measures for mean payoff games. 
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